Spatial data compression via adaptive dispersion clustering

نویسندگان

  • Yuliya Marchetti
  • Hai Nguyen
  • Amy Braverman
  • Noel Cressie
چکیده

In this article, we introduce a method of spatial data compression, which we call Adaptive Spatial Dispersion Clustering (ASDC). It is specifically designed to reduce the size of a spatial dataset in order to facilitate subsequent spatial prediction. Unlike with traditional data and image compression methods, the goal of ASDC is to create a new dataset that will be used as input into spatial prediction methods, such as traditional kriging or Fixed Rank Kriging, where using the full dataset may be computationally infeasible. ASDC can be classified as a lossy compression method and is based on spectral clustering. It aims to produce contiguous spatial clusters and to preserve the spatial correlation structure of the data so that the loss of predictive information is minimal. Through simulations, we demonstrate the predictive performance of these adaptively compressed datasets for several scenarios. ASDC is compared to two other data-reduction schemes, one using local neighborhoods and one using simple binning. We also present an application to remotely sensed sea-surface temperature data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Data Compression Model Based on Spatial Clustering and Principal Component Analysis in Wireless Sensor Networks

Wireless sensor networks (WSNs) have been widely used to monitor the environment, and sensors in WSNs are usually power constrained. Because inner-node communication consumes most of the power, efficient data compression schemes are needed to reduce the data transmission to prolong the lifetime of WSNs. In this paper, we propose an efficient data compression model to aggregate data, which is ba...

متن کامل

An Adaptive LEACH-based Clustering Algorithm for Wireless Sensor Networks

LEACH is the most popular clastering algorithm in Wireless Sensor Networks (WSNs). However, it has two main drawbacks, including random selection of cluster heads, and direct communication of cluster heads with the sink. This paper aims to introduce a new centralized cluster-based routing protocol named LEACH-AEC (LEACH with Adaptive Energy Consumption), which guarantees to generate balanced cl...

متن کامل

Clustering with Beta Divergences

Clustering algorithms start with a fixed divergence, which captures the possibly asymmetric distance between a sample and a centroid. In the mixture model setting, the sample distribution plays the same role. When all attributes have the same topology and dispersion, the data are said to be homogeneous. If the prior knowledge of the distribution is inaccurate or the set of plausible distributio...

متن کامل

Large Dataset Compression Approach Using Intelligent Technique

Data clustering is a process of putting similar data into groups. A clustering algorithms partition data set into several groups such that the similarity within a group is larger than among groups. Association rule is one of the possible methods for analysis of data. The association rules algorithm generates a huge number of association rules, of which many are redundant. The main idea of this ...

متن کامل

A Scene Adaptive and Signal Adaptive Quantization for Subband Image and Video Compression Using Wave - Circuits and Systems for Video Technology, IEEE Transactions on

Discrete wavelet transform (DWT) provides an advantageous framework of multiresolution space-frequency representation with promising applications in image processing. The challenge as well as the opportunity in wavelet-based compression is to exploit the characteristics of the subband coefficients with respect to both spectral and spatial localities. A common problem with many existing quantiza...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 117  شماره 

صفحات  -

تاریخ انتشار 2018